Sequence-based estimation of minisatellite and microsatellite repeat variability.
نویسندگان
چکیده
Variable tandem repeats are frequently used for genetic mapping, genotyping, and forensics studies. Moreover, variation in some repeats underlies rapidly evolving traits or certain diseases. However, mutation rates vary greatly from repeat to repeat, and as a consequence, not all tandem repeats are suitable genetic markers or interesting unstable genetic modules. We developed a model, "SERV," that predicts the variability of a broad range of tandem repeats in a wide range of organisms. The nonlinear model uses three basic characteristics of the repeat (number of repeated units, unit length, and purity) to produce a numeric "VARscore" that correlates with repeat variability. SERV was experimentally validated using a large set of different artificial repeats located in the Saccharomyces cerevisiae URA3 gene. Further in silico analysis shows that SERV outperforms existing models and accurately predicts repeat variability in bacteria and eukaryotes, including plants and humans. Using SERV, we demonstrate significant enrichment of variable repeats within human genes involved in transcriptional regulation, chromatin remodeling, morphogenesis, and neurogenesis. Moreover, SERV allows identification of known and candidate genes involved in repeat-based diseases. In addition, we demonstrate the use of SERV for the selection and comparison of suitable variable repeats for genotyping and forensic purposes. Our analysis indicates that tandem repeats used for genotyping should have a VARscore between 1 and 3. SERV is publicly available from http://hulsweb1.cgr.harvard.edu/SERV/.
منابع مشابه
Use of microsatellite markers for molecular characterization of cumin (Cuminum cyminum L.) ecotypes
In this study, Simple Sequence Repeat markers (SSR) were used to investigate the genetic variation between 49 cumin ecotypes collected from 9 different provinces of Iran. SSR primers Elap1479, Elap040 and Elap1493 showed the highest (89%), while Elap1340 and Elap017 showed the lowest (56%) number of polymorphic bands. Polymorphism information content (PIC) values varied between 0.18 - 0.37. The...
متن کاملSimultaneous estimation of all the parameters of a stepwise mutation model.
Minisatellite and microsatellite are short tandemly repetitive sequences dispersed in eukaryotic genomes, many of which are highly polymorphic due to copy number variation of the repeats. Because mutation changes copy numbers of the repeat sequences in a generalized stepwise fashion, stepwise mutation models are widely used for studying the dynamics of these loci. We propose a minimum chi-squar...
متن کاملImmortal, telomerase-negative cell lines derived from a Li-Fraumeni syndrome patient exhibit telomere length variability and chromosomal and minisatellite instabilities.
Five immortal cell lines derived from a Li-Fraumeni syndrome patient (MDAH 087) with a germline mutant p53 allele were characterized with respect to telomere length and genomic instability. The remaining wild-type p53 allele is lost in the cell lines. Telomerase activity was undetectable in all immortal cell lines. Five subclones of each cell line and five re-subclones of each of the subclones ...
متن کاملWhole genome shotgun sequence of Bacillus amyloliquefaciens TF28, a biocontrol entophytic bacterium
Bacillus amyloliquefaciens TF28 is a biocontrol endophytic bacterium that is capable of inhibition of a broad range of plant pathogenic fungi. The strain has the potential to be developed into a biocontrol agent for use in agriculture. Here we report the whole-genome shotgun sequence of the strain. The genome size of B. amyloliquefaciens TF28 is 3,987,635 bp which consists of 3754 protein-codin...
متن کاملIdentification of DNA probes that reveal polymorphisms among closely related <Emphasis Type="Italic">Phaseolus vulgaris </Emphasis> lines
Analyses of genetic diversity within populations could be of great benefit to plant genetic resources conservation . In order to identify genetic markers that are variable within populations, the genome of Phaseolus vulgaris was screened with several DNA sequences in order to identify hypervariable sequences . Polymorphisms were observed between Middle American and Andean cultivars using the pr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Genome research
دوره 17 12 شماره
صفحات -
تاریخ انتشار 2007